
The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: O/Pif 



Date Processed by STIC: 1^(^/1^0/9 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) „ 
PATENTIN 3.0 e-mail help: patin30help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3,0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



jfcw Sequence Lis^^lBrror^iminary 



m ^* * 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER; 

^^a euti' . r , ^^ 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

Wrapped Nucleics The numberAext at the end of each line "wrapped" down to the next line. 

This may occur If your file was retrieved In a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

Wrapped Aminos The amino acid-numberAexl at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved In a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping''. 

Incor/ect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 

Non-ASCII 



Variable Length 



Patentln ver. 2.0 "bug" 



The numbering under each 5th amino acid Is misaligned. This may be caused by the'use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing^twfen the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text^o that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the _ 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
{OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTER!STICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response !o include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
«400> sequence id number 
000 



10 ^ Use of n's' or Xaa's 
(NEW f&Lft)* 



Use of n's and/or JCaa's have been delected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n oi Xaa represents. 



11 Use of <213>Organtsm Sequence(s)_ 

(NEW RULES) 



are missing this mandatory field or its response. 



1 2 Use of <220>Feature 

(NEW RULES) 



Sequence(s) are missing the <220> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown" — -J * " 

Please explain source of genetic material in <220> lo <223> section. 

(See "Federal Register," 6/01/98, VoL 63', No. 104, pp. 29631-32) (Sec. i.823of new Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 
*fiIe*TeTOIting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to floppy disk. 



/ 



AKS-Biolechnology Systems Branch- 5/15/99 



Page 1 of 7 

OIPE 



c-- 

C-- 



3 3 
'5 4 
1 5 
37 

3;< 

3 9 
41) 

■1.1 

-! 2 

4 3 

■!fi 
4 C , 

4 7 
•■13 
49 
50 
51 
52 

5 3 
5 4 
55 
56 
57 
53 
5 9 



RAW 

p / ■. 7 



^ 111 ' 



SEQUENCE LISTING 

;M'f APPLICATION: US/09/7 16 , 964 



DAT 13 : 
YTME: 



V<>7/2000 
' : 50 : 0 3 



ijf. i 



A: \R1030L .app 

N:\CRF3\12072000\I716964 .raw 



Does Not Comply 
Corrected Diskette Needed 



<110> AP 



V"il i 



O ' norm ell , 
Alexander 

]ya 
Da v id 



Michael K. 



a 

"i u 
ii 
i 2 
i- 1 - 

16 
17 

19 
2i: 



PRIOR 
PR TOR 
P R 7 OR 
PR TOR 
PU.LOK 
NUMBER 



L3 0> 
<140> 
<14 1> 

<i:hj> 

<15.1> 
<15 0> 
<151> 
< i 5 0> 
O.r- 1> 
•;.160> 
<17 0>- 
C210> 

<2;i i> 

<;212> 
<21'3> 
<4 00> 

tceyqyngt;.o 
gecacctccri: 
a f ;tagcc tty 

ycct.a-.:ctc:t 
atgycyy Lyy 
g!:geagaqyg 
yaygacy t.yc 
q tot to a tec 
accctyyayy 
atge coceea 
yayategect 
gcoctccLcc 
yaycyotteo 
Lccceceoag 
goqyeectgy 
leygycc L LL 
f.'COCM.cr.;;! 
gag oqocY.eq 
ngggceclgg 
f. eCfjaycCqg 
egyyaycyyt 



' . J CAN ! : 
ha .kOV , 

leva, i" 
J <•» m i a .ni'.i , 
Bruek, r.riiia 
Kii.i'i.;. an , John 

TTTr.K OK INVENTION: FN/.^MFS DERIVED FROM THERMOPHILIC ORGANISMS 
F U NC 1 I O N AS A CH ROMOSOMAL RLPL 1 CAS L , PRL'PARAT ION AND 

FU.F RFFFF.ENCE: 2>>:21/1030 

CURRENT APPLICATION NUMBER: US/09/716,964 
CURRENT FILING DATE: 2 0 00-11-21 

PRIOR APPLICATION NUMBER: GO/14 3, 
iy97-04-0« 
NUMB tIP : 08/82 3, 

] 997- 04 -08 
NUMBER: 00/057, 
19*33 -04 -0B 
NOS : 212 



FIX! NG 
APPLVC 
TIL in. - ; 
appt.tc: 

11 L J.MC. 
OF SFO 



DATE : 
T'lON 
I j ATE : 
TION 
DATE: 
ID 



,'02 



4 07 



SOFTWARE 
5 FO Z TJ HO : I 
LENGTH: 2 0 07 
•i i'i'E : DMA 
ORGANT SH : The mm 
SFQUENCV;: 1 

c.-qytt.c:c:cag 

tgageycer; (: 
a eg ty aagga 
tct(.-cy'.igt:c 
ygtgc^tgyy 
yog coca ccc 
gyyagctyag 
Log a eg a ggc 
agecc^eyoc 
ccrJ t;r;eLc !.C 
t taayeteeg 
t ^'COCg 
tcot."- : tg 
gyaecygyg L 
gee tcgoccg 
t.yyaggly L t 
eeocqecetvj 
crc<\:r.q<~ : o 
cogecgayge 
a.'iogcrcccc 
ggcyggectt 



ten tin Ver. 



t her mop hi 1 us 



g tag a ccccg 
oaaggc^tqr 
ot.acegocgc 

'jCCCCt.OCtC 

cagygqegtg 
ggaagacccc 
y yaogtggtg 
ggnaagga Lc 
ccaca tgctic 
ecacgtccte 
ceycaceeag 
gcyea i.cctg 
cctgycggac 
:iqgecCCCLC 
ggecgaga Lc 
gcgcctctac 
ccyggaayyc 
ggooetqatc 
egaegcotta 



geecctoceg 
«aggagagga 
tkccgcecec 
aaggccat.ee 
gg« - :aagacca 
ecttgeygyg 
yaea ttgacg 
cacctcyccc 
tecaaaageg 
ttcytettcg 
cacttccyct 
gaggc^gtgg 
ggggee c tta 
acceggaagg 
gccgcctcoc 
gggqaagggt 
ctctacgccg 
gccgccatga 
agcctggnyg 



tgagcccctt tacocaggcc 
acgtccgcac cacgccctat. 
tea net tec a g g a g q t gg t g 
ggyaggggag gctcycccag 



c c L a c c c c a g c c c a eg g g eg 
g a c c c c g g a a c c c e c a a g go 
cctcgaggcc ctcaggccca 



ccacggcgag 
tetyecccca 
ecgecagcaa 
ccctcLctgc 
cetteaaege 
<;oaccaccga 
tccgccgcct 

gy^ggyayyc 

gggjcgcgge 
aggtggagcg 
Legcgagggg 
acqecccgag 
ccttcygcct 
ccgceetgga 
tggece tcct; 
ctccttccce 
ccgaggaggc 
ccctacgggc 



gctcotcgec 
ctgceaggcg 
eaactcegtg 
ccccaygaag 
ccf.cctcaag 
geccgayagy 
oaeggaggag 
ggaggaggag 
aagcetcctg 
cgcecUaggc 
gaaaacggcg 
gagect gg t.c 
cgcygyaacc 
cgaggecatg 
ggaggeggga 
a'gagg tegge 
gccoy.-iccty 
cLtcgtgcgg 



60 
120 
.180 
240 
300 
3 60 
a 2 0 
100 
.540 
600 
660 
' ? 20 
7o0 
H40 
900 
960 
1020 
1.000 
1 3 40 
.1 2 0 0 
1. 2 6 0 
1320 
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RAW SEQUENCE LISTING 

PATENT APPLICATION; US/09/716,964 



PAH/: 12/07/2000 



Input; Set. : 
Output SeU: 



A:\R10J01 .app 

N:\CRF3\12072000\I716964 .raw 



SO naggoc^qec cgqaggtccg ygaagyccay cl.rtqcct.cq ct. t.Lccceqa aoaeaayycc 

hi LLccacL.acc yoaaggcctc ggaacagaay gLgaqgctce LoocccLggc ccaygcccat. 

o'J tt cqcrqqLqq aqqaqqtcgt ccLcqLccty qaqyqaqaaa aaaaaagcct yaqcccaagq 

6 i c c c c f r c c 1 : q q c r. c c a c c t; c c t q a a q o g c c e q c a c r : c c r s y q g c c c r . c c c 9 a g ** j . ■ 1 9 q a q q t ;■ i 

6 4 q a r, y c g tj a q q a a a c y g c g q a g g a g g c c c c < ] g a q q a q q c c V. t; g a q ■; 1 c y q g r. g q r. ; ' q c c t c 

b 5 clygyyy g g c y g q t y c v. c t: g g g L g c y g c ■ q q c < : c a y g a ; .■ c 0 g g g a g g c g c c c ] y q g a g y a a 

fib cccct.gagcc aagacqaqat aygqqqt.act. gg La Lata at qqqqgcatqa cq^qqaccac 

u7 oqacrtcyga eaagagaocq tqyacaacat ccLcaagcgc ot.coqccqLa tt/iaqgycca 

6 8 gqtqcqqqgg ctccagaaga tggtggccga yggccqcrcc tqcqacgagq t c.-.tcacrca 
09 gatqacegcc aeoaagaagg ccatgyayyo ygrgyccucc cl.gat.cct.ee a-.-gsg Lt.cci 
70 qaacgtcLgc: gccgccgcigq tctccgayyg caagyLgaac cccaaqaagc ceyayyayat 

7 i cqccaccatq ctgaagaact tcatcta 
7-1 <210> SL'O JO NO: 2 

~f: <2IT> LENGTH: 329 
<212> IYPL : PHI 

7 7 <2}'*> ORGANISM: fhernms thermophi J us 
?'j *00> SEQUENCE; : 2 

B0 Hc-V. Ser Ala Leu Tyr Arg Arq Phe Arg Pro Leu Thr Phe Gin Glu Val 

hi .1 n 10 15 

S.'i Vai Giy Gin GJ u ills Val Lys Glu Pro Leu Leu Lys Ala lie Atq Gin 

84 20 25 . .'i0 

8 6 Giy Arq Leu Ala Gin Ala Tyr Leu Phe Ser G.ry .Pro Arg Giy G l.v 
a 7 3 5 4 0 4 5 

Lye; ihr Thr Thr Ala Arg Leu Leu Ala Mel Ala Vai Giy Gys <;i;\ Giy 

9 0 5 (.1 55 6 0 

V2 Giu Asp Pro Pro Cys Giy Val Cys Pro His Gys Glu Aia Val Gib A.cq 

r! 1 05 70 7=5 HO 

'3 5 Giy Ala His Pro Asp Val Val Asp He Asp Ala Ala Her Asn Asn Ser 

»« " 05 so *-■:.. 

l)H Val Glu Asp Vai Arq Giu Leu Arg Glu Arg He His Leu Mo Pre; Leu 

0 0 .100 .10 5 110 

mi.' Ser Ala Pro Arg Lys Val Phe lie Leu Asp Glu Ala His Met Leu Ser 

;02 115 120 125) 

.10 3 Lys Ser Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu Pro Pro Pro 

:05 .1 30 135 14 0 

.107 v'.r. Val Leu Phe Val Phe Aia Thr Thr Glu Pro GLu Arg Iter. Pro Pro 

108 H5 150 lii'fi Hit: 

110 I In lie Leu Ser Arg Thr Glu His Phe Arg Phe Arg Arg Leu ihr Gl u 

IM 16 5 17 0 17 5 

'113 Glu Glu lie Ala Phe Lys Leu Arg Arg He Leu Glu Ala Val Giy Ary 

11 4 180 10 5 190 

llf: Gi'.i Al.a Glu Glu Glu Ala Leu Leu Leu Leu Ala Arg Leu Ala A. ; *p ci.y 

11" 3 33 200 205 

1 lv Aia Leu Ary Asp Ala Glu ser Leu Leu Glu Arg phe l e.n Leu I v. a Glu 
3 20 2.10 215 220 

122 Giy Pro Leu Thr Arg Lys Glu Val Glu Arg Ala Leu Giy Ser Pro Pro 

I2 7. 225 230 2^5 240 

125 Gl.y Thr Giy Val Ala Glu He Ala Ala Ser Leu Ala Arg Giy lys Thr 

126 24 5 250 255 



ioRO 
1 4 4 0 

1 5 0 0 
15t.i0 

i;,2 0 

lfi£0 
i"4 0 
18 00 
18 h0 
.10 20 
i 9 f5 0 
200? 
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RAW SEQUENCE LISTING DAJ.'tf : .12/07/^000 

PATEN'I APPLICAJ VON : US/09/7 16 , 964 VIM 11 : 0 7 :' 50 : 0 3 

r n put Set : A:\Rl03 01.app 

Output. Set: N:\CRF3\12072000\l716964.raw 

LjH Ala Giu Aid Leu Gly Leu Ala Ary Arq Leu Vyr Gly Glu Gly Tyr Ala 

Jf»ti 1^7 0 

t-:'i. Pro Arq Ser Leu Veil Ser Gly Leu Leu Glu Val Phe Arq Glu Gly Leu 
.1 ; j v 5 1 J 3 0 > 8 5 

I 4 Jyr Al a Ala ehe Gly Leu Ala Gly Vhr Pro -.tin Pro Ala Pro Pru Gin 

i3~> j-su :i9 5 :*oo 

X " J A J . a ; . e u i 1 e a .1 a a 1. a M e t. i It r A 1 a i.eu A s p G .1 u Ala net. Glu A r q j.. <• ■. 1 t 
1 : ■•: '< 0 5 310 315 3 ^ '.J 

l)q Altj Arq Arq Ser Asp Aid Leu Ser leu Glu Vu "!. Ala I.eu re.u Glu Ala 
.Hi " " 31>5 330 335 

1 ■ ! 3 G i ■ A j: q A la L e. t ) A 1 a A la Glu Ala Leu P r o Gin 1-* r o Ihr G 1 y A 1 a P r o 
14 4 54 0 34 5 3 50 

I-}-;; ;-;er Pro Glu Val Gly pro Lys Pro G.Ui Ser Pro Pro J.'Jrr Pro Glu Pro 
'! 4 V 3 : o 3 f> ». ) 3 h 5 

14 9 Pro Art j Pro G.1 u Glu Ala Pro Asp i.eu Aiu Glu Aiq l'rp Arq Ala Phe 
.L5o <?0 ''7 5 3«u 

Lou Giu Ala Lei: Arq Pro Ihr Leu Arq Ala Phe Veil Arq Glu Ala Ary 

15 3 5b5 39 0 jN? 400 
1 f> 5 P r o G In Vci 1 A r q G 1 u G I y Glu Leu i : y .s Leu A J a P I m P r o G 1 u A s p I .y s 
15 0 4 05 410 415 

i~8 Ala Phe His ryr Arq Lys Ala Ser Glu Glii Lys Veil Arq leu Leu Pro 

I5»i 4 20 A 2 ft 4 30 

l5>.t Leu Ala Gin His Phe Gly Val Glu Clu Vn'l Va 1. I.e.; Va 1 l.uu Giu 

lr.;: 4 3 5 440 > 4 5 

.! 0 4 Gly GMi l.ys r.-ys Ser l.eu Ser Pro ?\rq Pro Arq Pro A.U; Pro Pro Pro 

163 4 50 4 55 4 60 

10" Glu Ala Pro Ala Pro Pro Gly Pro Pro Gin Glu Glu Val Glu Ala Giu 
IbH 4 6 5 4 70 4 7 5 4 HO 

'17 0 Glu Ala Ala Gr;t Giu Ala Pro G.lu Glu Ala Leu Arq Arq Val Val Arq 
171 4 85 4 00 495 

"J?3 Leu Lea Gay Glv Arq Val Leu :.rp Vai Arq Arq Pro Arq Thr Arq Giu 
1 74 5 >10 50 5 510 

! 7 *'■'> a 1 a P r o G .1 u G J a C 1 u Pro Leu Ser G 1 n Asp Glu He G 1 , y G 1 y T 1 1 r o 1 y 
17" 515 520 525 
17 0 ile 

183 G'M0-> ST.Q TL WO: 3 

184 OVi"; LENGTH. J59 0 
1155 <2\2> IYPL: fNA 

< 2 J. 3> ORGANISM: Thermos the rmoph i In s 
;1HH <:4u0> .'JKQIJKNCK: 3 

lcV* qrqaqcqccc LoUJOCgocy cVtccycccc uLoaeotLcc agqagqtgqt qgqqoaqqaq 60 

TOO cacqf.qaaqq viecooLeet eaagyecate cqgyagggga ggctcgcf:ca qqoctacct c 120 

l'J.i t;tctccqc;qo rcnqqqgcqt qqueaaqacc aecaeqqeqa qgctcoLcqo eatqqcqqr q 1H0 

1. 'j 2 q a g I". q c e :< q q f j q ' j ^ :i q a c c : o e •:: c J . L" q q q q q i . e t;q e c c o i: a o t g e e a q y e qqt.qcagagq i 4 0 

1S3 qqcqceeacc eqqacqiqqL'. qqacatv.qac qecqceaqoa acaactccyt. qqaqgacglq ^00 

194 cqqqaqor.qa qqqaaaqqaL ooacotcqcc ccoctctctq eccoeaqqaa qgtchr.catc 300 

19 5 ctqqaeqaqq cceacatqcL etecaaaagc qeo Ltcu^cq rccLertieaa gacoctggag ^120 

n*0 qagceeocqe ccoacgtrict. c< LcqtcLtc qcenccaeoq aqccegaqag g.j Lgcccocc 480 

10 7 a c o d L c c L c L e c c j c a e c c ci goacLLoc g c L Le c g c g c e Lc a c y g a y g a gyay a L c y c c 5 1 0 
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RAW SEQUENCE LISTING 

PATENT APPLJ.CAJ.10t4: US/09/716,964 



I>AJ.h: : I 2/0~/^0o0 



iiipul St*-1;. : 
Output. SrM; 



A:\Rl0301.app 

N:\CRF3\12072000\I716964 .raw 



193 

20 U 
20! 
2 0 2 
2 '.Hi 
J-: 0 4 
20 5 

::o6 



:•: ! o 

2 }. L 



21 .» 
2 3. a 
2ig 

2 LiU 
221 



22n 
230 

2 3.1 

J 33 
234 



J 4 0 
241 
24 i 
24 5 
246 
248 
;;4 U 



LtJ aagctoe qqcqea UvM. gqaqqccqi g qqq< *q<-iq ^qq 

eLeelcgece gcctggcgga cggggccuLt agggacgogg 

CLCCLCCtqq iiJqqCCCCCL C(l CCOq Ui;i a CJ qaqqt.qqaqO 

auqaccgqqg t gqrcgaga <: rg-.;rgr;r;tc." vTLcqewagqg 

qqcctcqcr. c g q c q c o ' . f : 1 . a o q g q q a a g g q i a r : q o t * c: c g a 

i . l.qgagqtgl. Lmruqgaaqg cci;c i.ucqo.- ..;<vi; I Lcqqcc 

i j c c c o q c c c c a g q c c <: l . q a L o q o r; q c c a 1; q . i r. c q c c v L* q q 

urccqcrqc L rcqacqrcT.t. aaqfv:t.qqaq q i.qqoi;':?.;of: 

g c c qc c ga q q rc:t.acc c ■ j n q c c c a c q g 



q d a a q c c cf ' c a i a o o c < ; g g a a o c; t : c c a a q q 
L q q eq q q CO L L r: c r. c q a q q c f ; c !". f ; a q g r r : r; 



qctcc t. I ccr 

• ' rc q ; - gq^gg 

.-.CC* Lscq qq 



nrqqaqq tec: qqqaagqcca 'SCiX't'icrm: qci:t rccccq 

r q <: a a g q o c t r. g g a a c a q a a q g t q a q g c t c c c. c c c. 1 ; q q 

q a q g a q q t . c q tcctcjtc o t q q a q q q a ( r a a a a a a < i a a i \ c 

ifci-ccdrac rLqaaqcqc;: rqc^ccrr^q iq<.:':(:i. cccq 

qaaqcqqcqq aqq?i gqci.-f.'i.* q qa q q a qcj r '.* i t; q Mqqcq q q 

t: q q q t q C LC t; q q q Z q c q q C <J g C C C a q q a C C q q qa q q C q C 

c o a q a i v i : a g a Lao q g q q r. a c I y q t a t. a t. a a 
:210> SEC' TO NO; 4 
<211> LENGTH: 4 6 4 
<2 4 2.> TVPV:: PKT 

•:213> ORGANISM : Tis^r^us i.Liormopil i j.ns 
<4u(J> SE0UEW:.:F: : 4 
Met". S<*..r Ala Leu Tyi Arg Arg i^he 



( J L"0 L.'2 li 

1 0 



VaL G3.y uln Glu His \'al Lys G ' a Pro L«u U?tj 

L-0 2~, 

Giy Arq T.eii Ala Gir> Ala Tyr r.«u i/'m ; : Ser Gly 

!: 5 4 V 

Lys rhr Thr rhr Ala Arg Leu Leu Ala Her. Ala 



:-0 G.itl rsp Pro P I t 



Gl T 



Pro His Cvs 



Asp Ala 
g t ! 



G Ly Alo His Pro Asp Va i. Vai Asp 

>.t-:> 

Val. GJ.u Asp Vai Arg G i u T.eu Arq GJu Arq TI.G 

.: 0 0 . 1 0 5 

sor Aia Pro Atq Lys VaJ Pi:-., ii 

1.15 12 0 

f.ys Sor Ala Pho Asn A.ia I'.^-mi Lou l.ys "J.iu 



i; Asp Gin 



130 



13 5 



!is VaJ Loiti Phc Val j'he A!. a [Jir Tnr oiu Pro 



rqqagqaqga 
aadgcotcct 
qo«jccct.aqq 
qg a aaacggc 
qgaqcctqqt 
t c q t: g g g a a c 
•it :qa qqcca L 
t.ggdggrggg 
caqaqqrcqg 
cq*,;v:cqa ccL 
coUt.cgLgcq 
a qq a ca a qtj c 
ccoaggccca 
t-.ciaqcccaag 

dqqaqqaqq t 
1. qq t.CC q CO t 

eggaggagqa 



qqofir l:t!Ct.r; 
ggagogctLc 
ctccccccca 

qqaqqc^Ctq 
or.cg qqccf.l. 
occectLeoc 
q qaqc.q cctc 
a.-iaqqccctq 
occcaaqccq 
qcqgqaq<;qq 
<iqaq qcccqf : 

;;Li:ccact;.io 

t Itcqqqqtq 
qcrccuccrq 
aqaqqcq qaq 
i;qqqqqqq 

accccuqaqc 



6 00 
ii o o 

7 2 0 
" l l 0 

8 JO 

9 0 0 

'J r, li 

! 0 2 0 

10 80 
i 1 4 0 
1200 
1 2 f> 0 
.L3.V0 
'f .-;ho 

i 4 4 0 

i r.oo 

L 5 6 0 
5 590 



I!ir 

Lys 

Pre- 

Va 1 
(■o 
G 1 u 

Ala 

tUs 

Ala 

Glu 
.1 4 0 
Gin 



PI.-; Gin 

Ala lie 

?0 

Arq Giy 

4 5 

GJ.y Cys 

Ala Val 

Stir Asn 

Loti A* a 
110 

His Met 
:12 5 

Glu Pro 

Arq Mer. 



14 5 



1'3 0 



i 5 5 



^54 ihr lie Leu lier: Arg i'hr t«iij His i J hs Arg Plu-j Arq Arg L£u 



1 1) : 



I/O 



25 
260 
2 6 1 



Glu Glu lie Ai.a Phe Lys Leu Arg Arg He I.eu Glu 

i 2 0 i ;4 5 

C.lu A ia G'iu Glu G.!u Ala Leu l eu Lou Leu A] a Arg 
195 200 



Ala Val 
190 
Leu Ala 
20 5 



Glu val 

I a 

Arq Glu 
Val Giy 
Gin Gly 
Gin Arg 

y o 

As n Si^r 

0 5 
Pro la-j 

Leu Sftr 

Pro Pro 

?to Pro 

ie;o 

lhr Glu 
Gly Arg 
Asp Gly 
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RAW SEQUENCE LISTING 

!•• A i' L H 'I A PPL'ICA'J ION : US/09/7 16, 964 



DAJ. J::: .1 2/0 1/10 0 0 
riMK: 07:50:03 



input Set : A:\R10 301.app 

eurput Set: N:\CRF3\12072000\I716964.raw 

Ala Leu Arq Asp Ala Glu Ser Leu Leu Glu Arq Phe Leu Leu Leu Glu 



22 0 



Glv Pro Leu !hr Arcj r.ys Glu Val Glu Arq Al a Leu Gly Ser Pro Pro 



2 <0 



2 5 



24 0 



c.iy rii:: Gly v?il Ala Glu lie-- Ala Ala Ser Leu Ala Ar^ Gly Lvs Thr 



Arq 



:;-4 5 
Let.! G i.y Leu 

Leu val Her 



!.a" Arq Aru i.eu i.y r Gl 



Glu Gly iyr Ala 
:;05 270 

l .e u Leu G 1 u v i j 1 j." h e A r g Glu G 1 y J „ e 1 1 
280 235 



Ala Phe Gly Lou /-ia c; L; 

2u r ) 



i'hr ho Leu pro Ala Pro Pro Gin 

•■too 



Ala i.eti lie Ala Ala Met Liu: Ala Leu Asp Glu Ala Met Glu Arq Leu 



310 



Arq Arq Ser Asp Ala Leu s«r Leu Glu Val Ala Leu Leu Glu Ala 



;.i0 



3 3! 



cViv Arq Ala Leu Ala Ala Glu Ala Leu Pro Gin Pro Thr Gly Ala Pro 



LIO 



3 -i 5 



5 0 



Ser Pro Glu Val Glv Pro T.vs Pro Glu Ser Pro Pro Thr Pro Glu Pro 



3 6 0 



36? 



Pro Arq Pro Glu Glu Ala Pro Asp Leu Arq Glu Ary Trp Arg Ala Phe 

3*0 3 7 5 380 

Leu G !,u A.ia Leu Arq Pro Thr Leu Arq Ala Phe Va l Arq Glu Ala Arq 



3 a r 



:j v o 



•l 00 



Pro G \ u Val Arq Glu Gly Gin Leu Cys Leu Ala Phe Pro Glu Asp Lys 



4 0 5 



41 0 



4.15 



M i Phe iLis Tyr Arq Lys A L n .Ser GJ.u Gin Lys Val Arq Leu Leu Pro 



4 20 



43 0 



Leu Ala Gin /.la His Phe G.Lv Val Glu Glu Val Val Leu Val Leu Glu 



G 



4 4 0 



0 45 



Lys Lys Pro Glv p^o Lys Ala Pro Pro Gly 
4 5 5. 4 60 



Pro Thr Ser 



; 150 455. 

; -*:jUj> .SFQ I I; NO : = 

: » -;^11> LKNG'rll : -i54 

' 012> IVPL: PR1 

.. 2 1 '.->> OPG A >j ISM : T h e rm 1 s L h e nnoph i. J. u r 

» *i00> SLQURNGM : 5 



Ser Ala Leu Tyr Arq A.'.q Phe Aig Pro Leu Thr Phe Gin Glu Val 



!. 



10 



.15 



va i. Gly Gin Glu P. is Val Lys Glu Pro Leu Leu Lys Ala He Arq Glu 

20 25 3 0 

G 1 y A r g L e u A, 1 a Gin A .1 a T y r Leu P h e 3 e r Gly Pro Arg Gly Va 1 G 1 y 

3 5 4 0 4 5 

Lys Thr l:..r Vhr Ala A rq Leo Leu Ala MeL Ala Val Gly Cys Gin Gly 

50 5 5 6 0 

G 1 . u A s p P r o P r o C y s Gly Val Cys Pro His Cys Gin Ala Val Gin A r g 

6 5 70 7 5 80 

Glv Hi.s Pro Asp Val val Asp lie Asp Ala Ala Ser Asn Asn Ser 

US 90 95 
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<210> 89 
<211> 182 
<212> PRT 

<213> Deinococcus radiodurans 
<400> 89 

Pro Trp Pro Gin Asp Val Val Val Phe Asp Leu Glu Thr Thr Gly Phe 
1 5 10 15 

Ser Pro Ala Ser Ala Ala He Val Glu He Gly Ala Val Arg He Val 
20 25 30 

Gly Gly Gin He Asp Glu Thr Leu Lys Phe Glu Thr Leu Val Arg Pro 
35 40 45 

Thr Arg Pro Asp Gly Ser Met Leu Ser He Pro Trp Gin Ala Gin Arg \ 

50 55 60 . 

Val His Gly He Ser Asp Glu Met Val Arg Arg Ala Pro Ala[Xaa)Lys yjJO( /J^^ ' 
65 70 75 \_J 80 



Asp Val Leu Pro Asp Phe Phe Asp Phe Val Asp Gly Ser Ala Val Val 
85 90 95 

Ala His Asn Val Ser Phe Asp Gly Gly Phe Met Arg Ala Gly Ala Glu 
100 105 110 

Arg Leu Glv Leu Ser Trp Ala Pro Glu Arg Glu Leu Cys Thr Met Gin 
115 120 125 

Leu Ser Arg Arg Ala Phe Pro Arg Glu Arg Thr His Asn Leu Thr Val 
130 135 140 

Leu Ala Glu Arg Leu Gly Leu Glu Phe Ala Pro Gly Gly Arg His Arg 
145 150 155 160 

Ser Tyr Gly Asp Val Gin Val Thr Ala Gin Ala Tyr Leu Arg Leu Leu 
165 170 175 

Glu Leu Leu Gly Glu Arg 
180 



(fh £vW 



f2C 

Please Note: 

Us of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 



VERIFICATION SUMMARY DA 11'.: .1 2/0 ' / J 00 0 

PA 1 ! : N T AP P L i C A J I ON: US/0 9/716, 964 H f-J 32 : ■') : fi A ; ; : • -1 

input. Si-st : A:\R10301.app 

vut put S;u:: N:\CRF3\12072000\l716964.raw 



L 


10 M 


:2 7i) 3 


; ( 


;.. 


J 7 M:27 


i C 


: ( 


i. 


3 :-. J 


■i : L 


3 ; i 


W: 


i. 




^ ; 2 


33 


*j : 


i. 


3 3 2 


■!: 'J4 3 






Lv32 


M : 


23* 


W : 




.1 032 




V. : 


! 


.! 03 2 


1-1 : 


34 0 


W: 


j. 




i 'l : 




W : 


L 


J'>74 


tf : 


23o 


vv : 


:. 


j.O 7 i 


M: 34 0 


v! : 




! 2 24 


M : 3 4 i 


W: 




.1 j -i :■: 


N : 


34i 


W : 


T. 


i 3 33', 


M : 


2'»K 


W : 




3 3 03 


: 


2 V ^ 


W : 


!.. 


i. 3 06 


'A . 


-: .1 0 


Vv : 


L 


1319 


M : 


2' 5 3 


v: 


L. 




!•] : 


2 3>, 


W : 




i J .1 'J 


H : 


.Mi) 


W : 


L 


i.Z-12 


M : 


238 


W : 




j 3 32 


"M : 


2 38 


W: 


v. 


I j 32 


M : 


m:j 


W : 


!. 


•j] r-9 


H : 


2 33 


v: : 


L 


21o3 


M: 238 


W : 


;.:*.! I 3 9 


; 1 . :> r i b 


V/ : 


V. 


2133 


.H- 


23 a 


W: 


3 


2H>';' 


W : 1 4 0 


lv : 


l: ^2 32 


M: 


2:?b 


K: 


v. 


22:^ 


M : 


25 » 


>? : 




2 2 3 :-: 


M ' 


iM'j 


W . 




2232 


M : 


>33 


S. 


r. 


-22 33 


H : 


.< 1 0 


W * 


3 


22 3 3 


t -I : 




w. 


1. 


22 Vr 


M : 


2 3*; 


w 




2255 


I-i : 


2 ■-; 6 


( ,: 




j235 


M : 


2 SB 


w 


M 


0 




0;: 3t;d 



Mandatory Feature m 
Mami.-ifory i.3^aturo n.i 
(43) "n" or "Xtia" u 
H.'jmU.j t.orv ! : v-. (J Lure 
Mandatory feature 

Mandatory L'o-ifure 
Mandatory f oat. tire 
< 13 } ,! n " or "Xaa" 
(43) "n" or- "Xari " 
(-!'.) "n" or ".via" 
M a n u a t . o r:y I- 1 o a 1 . u r o 
Manda tory f'sM LL:rt* 
(43; ' n*' or "X-:irr' 
!• j :-i n ( I ci i: o r ■ y t ' e a t: u re 
M -i ; 1 d d f o r y f.\i a t u r e 
(46) "rt" or: ;, X-aa" 
Manda!. of v Ferfi ure 
H i-j n d f 1 1. o r ■ \ ' P e at t rr * » 
ilf; "n" or "x«a" 



Mandator'.' K'.^ t. u i<:- 
Maiiiia t. o r y f-' e - a r. ' ; jr. 
-3.1 "ri"*or "X;;a" 
M c: n d a i, o r y r ' t? a t u r 
M u 1 1 d c t o r y f ' e a I:, u /: 
H a n 1 ;j r. o r y t e a 1: 11 r e 
i4a;idav.ory f'sfti.uro 
i iCi ) 'n" or ;! Xaa" 
M t : n .' J ,■ j t*. o r ■ y ; • ; .* :-j i :. 1 1 n j . 
Ma 1 1 d a 1; o r y i-'e a 1:. u re 
M a j i ; i a 1 : o r y F e a v. n.re 
M a u ; i c' r f "> c y P . ; t u r e 
:'.t;d in SeqNO"9 L 



numbe r, d i. i." f er:.^ , Rc?placed Appli«'*a L int: Number 
d j. l 1 : o r s , l o p i a c c d Current l" ' i 1 i. n y Date: 

sriiiKj, <22'\j- riot found for SCO U.n? ; 1 7 
issi.nq, <222> not found for Si-JO JLJ.^:!" 
s<vd: rcit.uro requXr^cl, lor sr20 1 f.< : 
miss i.nij - <22]> not found tor Si-.') I'on ; 2\< 
mi.:ryirjq, -:222> not. round for Si.<3 1 U ^ : 2: ') 
used: Vca ui n*. rf*qu i r<-'.d ( for Sf:0 13^:2'.; 
missinq. -^221> not: found for Sh:Q UU:30 
n;.^s.tria, <222> uof found tor 'Al'<j IDr-:.'S0 
■jsed: J-'oaturo requi.rt-:d, for SL'2 rDr:'iU 
u«rfd, for SIX) IDE :42 
utz-od , for StO I US : 1 3 

niissifK;, <.22.i> not found for SFfj. Jl.'s : ;» 
iiiissinq. :.^22-- not found for SD..J JD^:33 
u e d : V f? a 1 1 . 1 r e r * qui. rv. ci . f o r S p; w I i j ■ i : t > 
ini y s i n a , ; 2 2 .1 > n o t found for S 1 1 ? " ; 
01 is sinn, <2 22> not found for Hi:o lDf«:-j 7 
used: Peafurp rnqui red., for Sh'O nH-:fi7 
ni i s s i 1 i q t < 2 2 J > not. f o 1 : n d f o r S \: t.t ID ? : i : . 
it: i. s s i. n q , < 2 2 2 > : 1 or f on nd for S I:" 0 T Y> * : 6 ^ 
urfed : f ! fvav.iir« raqu.ired, tor .SCO Ttfr:6K 
ini:;siiiq. <?.2C> not found for StlQ I.fn:H9 
miyrii.nq, -..2 2 1:-- not. found for SCO LDi :?.■'} 
rai.ss.liKj, :22;^ not found for SKO TD=:>; l i 
Hi.L.^sfiaq, <223> not found for STQ ll)v.:t\': 
u d : Foa 1 1 f ro r o q u i red , for S DO T D : « 3' 
mi^aiuq. <?20> not found for sno I.D =:■*•*! 
m i s s i ii 9 , <*2 .! not f o and f or S 1 0 if : c .) 1 
ni i s s i n g , 2 2 2 n o 1' fan nd f or S tX> 1 o k : v .1 
mis^inq, -'22;:> not found for Sl';o iDf?:'M. 
u s o d : F « a r. u v<? r q u i r e d , for S F: o T . n >i : 0 . 1 
uiisding, <2 20> nor found tor SEO T.D&::'! 
missing, <22 i> not found for S€Q LOt^-i 
m i s s i 1 1 q , < 2 2 2 n o t: f o u nd f o r S f'O 1 2- « : 1 ; 1 
missing, <^23> not found for SCO 11:-^ : 3> 1 
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